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Abstract 

Higgs boson production in association with two tagging jets will be mediated by 
electroweak vector boson fusion and by gluon fusion processes at the CERN LHC. 
The characteristic distributions for the gluon fusion process are analyzed for the H —* 
W + W~ signal at mn = 160 GeV, with subsequent leptonic decay of the W-pair. 
The dominant backgrounds from top-quark pair production, WWjj production and 
vector boson fusion processes can be suppressed to a level of S/B 1/4, yielding a 
highly significant gluon fusion signal with 30 fb _1 . Analysis of the azimuthal angle 
correlations of the two jets provides for a direct measurement of the CP-nature of the 
Htt Yukawa coupling which is responsible for the effective Hgg vertex. 



1 Introduction 



Higgs boson production in association with two jets has emerged as a promising channel 
for Higgs boson discovery [U El El H] and for the study of Higgs boson properties [51 E] at 
the CERN Large Hadron Collider (LHC). Interest has concentrated on vector-boson- fusion 
(VBF), i.e. the weak process qq — > qqH which is mediated by t channel exchange of a W 
or Z, with the Higgs boson being radiated off this weak boson. The VBF production cross 
section measures the strength of the WWH and ZZH couplings, which, at tree level, require 
a vacuum expectation value for the scalar field. Hence the VBF channel is a sensitive probe 
of the Higgs mechanism as the source of electroweak symmetry breaking. 

Another prominent source of Hjj events are second order real emission corrections to 
the gluon fusion process. Such corrections were first considered in Ref. [71 [8] in the large top 
mass limit and have subsequently been evaluated for arbitrary quark masses in the loops 
which induce the effective coupling of the Higgs boson to gluons [5]. Some representative 
Feynman graphs are shown in Fig. [TJ In the m t — > oo limit, also the NLO QCD corrections 
have recently been calculated [10J. 




Figure 1: Feynman graphs contributing to pp — > Hjj. 

For a SM Higgs boson, the generic Hjj cross section from gluon fusion can somewhat 
exceed the VBF cross section of a few pb [5]. This raises the question whether gluon fusion 
induced Hjj events can be used as a source of information for measuring Higgs boson 
properties. For the VBF process, the most promising Higgs signal arises for Higgs boson 
masses around H^-pair threshold in the channel pp — > HjjX, H — > W + W~ — > Z + /~^ T [21 H]. 
This prompts us to investigate the gluon fusion contribution to this channel for a Higgs 
boson mass of rriH = 160 GeV. We analyze potential background processes, in particular 
the production of top-quark pairs in association with additional jets, QCD induced WW jj 
events and VBF processes, and we show in a parton level analysis that the gluon fusion 
induced Higgs production can be isolated as a highly significant signal with 30 fb" 1 of LHC 
data and with a signal to background ratio of about one to four. 

The resulting signal is large enough to derive Higgs boson properties from distributions. 
In this paper we focus on the CP properties of the Yukawa couplings to fermions, which are 
given by 

C Y = yfH-ipfipf + iy f A$fy 5 ipf, (1) 
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where H and A denote (pseudo) scalar Higgs fields which couple to fermions f = t,b,r etc. 
Via these Yukawa couplings, quark loops induce effective couplings of the Higgs boson to 
gluons. In our numerical analysis we consider couplings of SM strength, yf — yf — rrif/v — 
y SM . In this case the quark loops are dominated by the top quark, and the Higgs gluon 
coupling can be described by the effective Lagrangian [TJ [8] 

p 1H ^ s tt pya pya^u I Vt a pya pya Aivpa ic\\ 

where G* denotes the gluon field strength. The effective Lagrangian approximation provides 
an excellent description of the full results for Hjj production, provided one considers modest 
jet transverse momenta, pxj ^5 iri t , and Higgs boson masses well below the top quark pair 
production threshold [9]. We employ the effective Lagrangian description throughout this 
paper. From the effective Lagrangian emerge Hgg, Hggg and also Hgggg vertices, which 
correspond to triangle, box and pentagon top quark loops as in Fig. [TJ 

The structure of the left diagram in Fig. [TJ is very similar to the process of Higgs produc- 
tion in vector boson fusion [1] . In Ref . [TTJ it was shown that the distribution of the azimuthal 
angle between the two jets in Hjj events can be used to determine the tensor structure of the 
HVV coupling (V = W ± , Z). The same method can be applied to Higgs+2 jet production 
in gluon fusion. Here, the azimuthal angle distribution is sensitive to the tensor structure 
of the effective Hgg coupling, which is determined by the CP-structure of the top Yukawa 
coupling. More precisely, neglecting terms which vanish upon contraction with the conserved 
quark currents, the tensor structure of the Hgg vertex which emerges from Eq.p]) is given 

by 

T*» = a 2 ( gi ■ q 2 <T - + a 3 e^q lp q 2a , (3) 

where q\ and q 2 are the four-momenta of the two gluons. The scalar form factors a 2 , 0,3 are 
directly related to the Yukawa interactions of Eq. (TTJ, 



yt ®s yt &s (A s 

yf M 3tcv yt M 2nv 

Note that |a 3 | = | • \a 2 \ for y t = y t . Therefore the cross section for the case of a purely 
CP-odd Yukawa interaction will be about 1.5 2 = 2.25 times larger than the Standard Model 
cross section. In contrast to vector boson fusion, there are additional contributions from 
Higgs plus three and four gluon couplings. These may dilute the sensitivity to the structure 
of the Hgg vertex and we need to find the regions of phase space with the best analyzing 
power for differences between CP-even and CP-odd Yukawa couplings. 

The paper is organized as follows. In Section [2] we briefly describe the parton level Monte 
Carlo programs with which we determine the relevant cross sections. All calculations are 
done at tree level. We then determine the characteristic distributions of the gluon fusion 
signal and of the various backgrounds in Section [3] and devise cuts which provide a reasonable 
signal to background ratio and a high statistical signal significance. The measurement of 
the CP structure of the top Yukawa coupling is addressed in Section HI We determine the 
analyzing power of the azimuthal angle correlations between the tagging jets and find that 
it becomes more pronounced for larger rapidity separations of the jets. Final conclusions are 
drawn in Section 
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2 Calculational tools 



We consider Higgs+2 jet production in gluon fusion (GF) with the Higgs decaying into a 
pair of W bosons, which, in turn decay leptonically into electrons and muons = e , 
and the associated neutrinos: pp — > HjjX, H — > W + W~ — *> £ + £~uu. The dominant back- 
grounds are from top-pair production, pp — > ttX, and from it production with additional 
jets, pp — > tijX, pp — > tijjX. Another background is pair production with two accom- 
panying jets, — > W + W~jjX . This can be a QCD induced process of order a 2 a 2 , or it 
may arise from vector boson fusion, i.e. electroweak processes of the type qq — > qqW + W~ 
at order a 4 . We do not consider backgrounds from Zjj, Z — ► t + t~ and from produc- 
tion because they have been shown to be small in the analysis of Ref. [3]. All signal and 
background cross sections are determined in terms of full tree level matrix elements for the 
contributing subprocesses and are discussed in more detail below. Detector resolution effects 
are neglected in the following. 

For all our numerical results we simulate pp collisions at a center of mass energy of y/s = 
14 TeV. Standard Model parameters are set to sin 2 6 W = 0.23105, M z = 91.187 GeV and 
G F = 1.16637- lO -5 GeV~ 2 , which translates into M w = 79.962 GeV and a(M z ) = 1/128.92 
when using the tree-level relations between these input parameters. For all QCD effects, the 
running of the strong coupling constant is evaluated at leading order, with a s [Mz) = 0.1298. 
We employ CTEQ6L1 parton distribution functions throughout [T2] . 

2.1 The H + jj signal process 

The production of a Higgs boson in gluon fusion in association with two jets, at order a 4 , 
can proceed via the subprocesses [3 [8] 

qq' -> qq'H , qg -> qgH , gg -> ggH, (5) 

and all crossing related processes. The calculation of this process is based on the work of 
Ref. [9] . Instead of full top quark loops we express the matrix elements in terms of effective 
Higgs gluon vertices as given by Eqs. ( |2|3l) . We include CP-even and CP-odd Higgs couplings 
and any interference between them. In addition the program was extended with the matrix 
elements for the Higgs decay H — > W + W~ — ► 

Throughout the analysis we use the Standard Model coupling and branching ratio for 
the Higgs decay into W + W~ , B(H — ► Vy + Vy~) = 0.912 for a Higgs boson mass of m# = 
160 GeV. For the signal process, the factorization scale is chosen as /// = ■ s /pr 1 ■ pr 2 > where 
Pt 1/2 denote the transverse momenta of the two jets. The strong coupling constant is taken 
as ag = a 2 s (m H ) ■ a s (p Tl ) ■ a s (pT 2 )- 
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2.2 The QCD tt + jets backgrounds 



Given the Higgs decay signature, the main physics background to the t £ T jjjf T signal arises 
from ti + jets production, due to the large production cross section at the LHC and because 
the branching ratio B(t — > Wb) is essentially 100%. The basic process we consider is pp — > tt, 
which can be either gg— or gg-initiated, with the former strongly dominating at the LHC. 
Real emission QCD corrections lead to ti + j and tt + jj events. Relevant subprocesses for 
the latter are 

qq -> tig , gg -> tig , qq -> ttgg , 5-5- -> ttgg , 5-5- -> (6) 
and all crossing-related processes [13J. 

We calculate the tt, ttj and itjj processes at leading order. Hence, for tt+jets one has to 
avoid the phase space regions where the massless jets get soft or collinear with the b quarks, 
in order to have a finite cross section. In addition, double counting has to be avoided when 
combining the three background processes. This is achieved in the following way. For the tijj 
case, we require both tagging jets to arise from massless partons in our simulation. Similarly 
for tij production, exactly one tagging jet is allowed to arise from a b quark. Finally, the ti 
cross section corresponds to both tagging jets arising from b quarks. With this prescription 
there is no double counting and the cross sections are finite when applying the cuts described 
below in Section [31 

The calculation has been performed using the program of Ref . [13] . The decays of the top 
quarks and H^'s are included in the matrix elements, taking into account all off-shell effects 
for the leptonic final state |14j . In all cases, the factorization scale is chosen as /// = min(£ , y) 
of the top quarks and additional jets. The overall strong coupling constant factors for the 
LO ti+n jet cross section are calculated as (a s ) n+2 = YYi=i ^(-ErJ, where the product runs 
over n massless partons and the two top quarks. 

2.3 The EW WW + jj background 

This background arises from W + W~ bremsstrahlung in quark- (anti) quark scattering via 
t-channel electroweak boson exchange, with subsequent decay W + W~ — > £ + £~jf T , i,e, 

qq' -> qq'W + W- £ + £~jj^ T (7) 

and crossing related processes. The process was calculated with the program VBFNLO [HI 
[T5| [TBI [T7] , which also allows to calculate NLO corrections to distributions. We only use 
the tree level option, however. In Ref. [TT] it was shown that NLO effects are minimized 
by the factorization scale choice \l — Q at tree level, where Q is the momentum transfer 
of the t-channel electroweak boson. With this choice higher order QCD effects are well 
below 10%. This EW WW +jj background also includes Higgs production in VBF. In fact, 
for rriH = 160 GeV it is dominated by the Higgs contribution, which we here consider as 
a background to the observation of Hjj production in gluon fusion. In the following we 
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collectively refer to the VBF Higgs contribution as well as to continuum WW production in 
VBF as the "EW WWjf background. 

2.4 The QCD WW + jj background 

Real-emission QCD corrections to W + W~ production give rise to W + W~jj events. These 
background processes include [T%] 

qq' -> qq'W + W- , qg^qgW + W~, (8) 

which are dominated by t-channel gluon exchange, and all crossing related processes. The 
calculation has been done in the framework of the VBFNLO program by employing matrix 
elements that have been generated with MadGraph We call these processes collectively 
the "QCD WW background. The factorization scale is chosen as \i = min(p Tl ,p T2 ) of the 
two final state partons. The strong coupling constant factor is taken as a 2 s = a s {pT 1 )-o. s {PT 2 )i 
i.e. the transverse momentum of each additional parton is taken as the relevant scale for its 
production. 



3 Cross sections at the LHC 

The gluon fusion induced pp -> HjjX, H -> W { * ] W { * ] -> vv signal is characterized 
by two high p? tagging jets and the W decay leptons (e, /x). Here, the tagging jets are 
defined as the two highest momentum jets in an event. Thus, the signal characteristic is 
similar to the VBF Higgs signal [I]. However, one cannot simply follow the same search 
strategy as for VBF. The reason is illustrated in Fig. [2] which shows the rapidity separation, 
^Vjj = IVji ~ ?7j2 1 j of the two tagging jets and the dijet invariant mass, rrijj, for the signal 
and the background processes. The three ti + jets backgrounds have been combined for 
clarity, even though their individual distributions are slightly different. The shape of the 
distributions for H + jj in VBF is very different from that for H + jj in GF which resembles 
the background distributions. This is because the GF signal shares the characteristics of 
QCD processes, which are dominated by external gluons. So while these observables are 
very powerful in the VBF analysis, they are almost useless for separating the GF signal from 
the background. Therefore, we have to expect a significantly worse signal to background 
ratio compared to the VBF analyses. In the following we optimize cuts for a Higgs boson 
mass of mil = 160 GeV. For the calculation of the signal and background processes we 
impose the following minimal cuts: 

p T j > 30 GeV, \r}j\ < 4.5, \m 1 - m 2 \ > 1.0 

pn > 10 GeV, \ru\ < 2.5, AR je = yjfa - + - $,) 2 > 0.7 (9) 

Thus, the jets are required to have a transverse momentum of more than 30 GeV and a 
rapidity below 4.5 to be detected in the hadronic calorimeter. In the same way the two 



5 




Figure 2: Normalized rapidity separation (left) and dijet invariant mass (right) distribution 
of the two tagging jets for the signal (solid) and backgrounds: EW W + W~jj (dash-dotted), 
ti + jets (dashed) and QCD W + W~jj (dotted). The cuts of Eq. (jSJ) are imposed. 

charged leptons are required to have at least 10 GeV of transverse momentum and they 
should be sufficiently central in order to provide tracking information. Furthermore the 
jets and leptons are forced to be well separated from each other. Below we will argue for 
a substantially higher p T threshold for the charged leptons. Hence, standard LHC lepton 
triggers will have a have a very high efficiency for these jy-pair events. 

At the level of the inclusive cuts of Eq. (jHJ) we require only a modest rapidity separation 
of Arjjj > 1 for the two tagging jets which are defined as the two highest pt jets of an event. 
In contrast to the VBF studies, we do not require the two leptons to lie between these two 
tagging jets. Instead we will focus on angular and mass cuts of the Higgs decay products to 
isolate the signal. 

The cross sections resulting for the cuts of Eq. (Q are shown in the first line of Table [2j 
The signal cross section of 115 fb (which includes the branching ratios into leptons) is quite 
sizeable. The QCD WWjj cross section is about 3 times higher whereas the VBF process 
reaches 2/3 of the signal rate. The worst source of background arises from the tt processes, 
however, with a combined cross section of about 18 pb. 

In order to reduce this large ti background it is necessary to make use of a 6- veto, that 
is to discard all events where one or both jets are tagged as b jets. We allow for an overall 
mistagging probability of 10% for light partons with > 30 GeV and \r]\ < 2.4, which leads 
to an acceptable reduction of less than 20% for the signal and all non-tt backgrounds. We 
use the results of the CMS analysis of Ref. [20] for our assumptions on 6-veto efficiencies 
and mistagging probabilities. For a 10% mistagging probability per jet one finds 6-veto 
efficiencies in the range of 60% - 75%, depending on jet transverse momentum and rapidity 
(Pt, t)) as shown in Table CD With the b- veto, the top backgrounds are reduced by factors of 
3 to 8 as shown in line 2 of Table [2J The b- veto is less efficient for the tijj and tij processes 
than for the ti process since, by definition of the tagging jets, the pt of the b quark becomes 
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Table 1: The assumed fe-tagging efficiencies as a function of jet px and rapidity. From 
Ref. [20] 





30 GeV < j9 T < 50 GeV 


p T > 50 GeV 


1.4 < 

M 


\r)\ < 2.4 
< 1.4 


60% 
65% 


70% 
75% 



smaller the more massless jets are radiated. This is illustrated in the left plot of Fig. [31 The 
curve for the ti process shows the cut at 30 GeV of Eq. [9] because in this case both b-jets 
are tagging jets. For the tij and tijj processes either one or both 6-jets are distinct from 
the two tagging jets and therefore the pt distributions continue below 30 GeV. 



x10 3 x10 3 




Figure 3: Left: Normalized transverse momentum distribution of the 6-quarks for ti (solid), 
ti+j (dashed) and ti+jj (dotted). Right: Normalized distribution of the missing transverse 
momentum for signal and backgrounds as in Fig. The cuts of Eq. ([9]) are imposed. 

A significant difference between signal and background is provided by the angle between 
the charged decay leptons [21]. In the case where the W pair is produced via the Higgs 
decay, the W spins are anti-correlated, so the leptons are preferentially emitted in the same 
direction, close to each other. A large fraction of the backgrounds does not have anti- 
correlated W spins. This difference is is demonstrated in Fig. H]which shows the R-separation 
ARu and the dilepton invariant mass The invariant mass can be expressed by 

m u = 2E il Ei 2 (l - cos9 u ) (10) 

with Eg 1/2 and 9 U being the lepton energy and the dilepton opening angle respectively. Hence, 
a small opening angle also leads to small mu values which is the case for the Higgs signal as 
compared to the backgrounds [22]. In Fig. H] the distributions for the electroweak W + W~jj 
background is very similar to the signal distributions because the EW W + W~jj process 
is dominated by Higgs production. We exploit these features by imposing the following 
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Figure 4: Normalized distributions of the charged leptons R-separation (left) and dilepton 
invariant mass (right) after cuts of Eq. (Q for signal and background as in Fig. [2j 

lepton-pair angular and mass cuts: 

AR U < 1.1 , m u < 75 GeV (11) 

The results after these cuts are shown on the third line of Table [2j The signal and the EW 
W + W~jj cross section are cut in half but the other backgrounds are reduced by roughly 
one order of magnitude. 

Table 2: Signal rates for tuh = 160 GeV and corresponding background cross sections, in 
pp collisions at y^i = 14 TeV. Results are given for various levels of cuts and are labeled 
by equation numbers discussed in the text. All rates are given in fb. The last two columns 
give the signal to background ratio S/B and the S/\/B ratio for an assumed integrated 
luminosity of 30 fb -1 . 



cuts 


GF 


EW 
WWjj 


ti 


tij 


tijj 


QCD 

WWjj 


S/B 


s/Vb 


inclusive cuts © 


115.2 


75.1 


6832 


9518 


1676 


363 


1/160 


4.6 


+ b veto 


99.2 


67.4 


833 


1822 


564 


307 


1/36 


9.1 


+ Ru, m u cut ffTTj) 


55.8 


30.7 


104 


218 


86.4 


42.7 


1/8.6 


13.9 


+ pre cut $12]) 


41.5 


22.3 


38.3 


87.7 


29.2 


20.5 


1/4.8 


16.2 


+ m^ w cuts (|15||16| 


37.1 


19.9 


30.1 


63.4 


19.3 


13.4 


1/3.8 


16.8 


+ y T c ^ (HZD 


31.5 


16.5 


23.3 


51.1 


11.2 


11.4 


1/3.6 


16.2 



Examining the lepton p? distributions, it turns out that background events which survive 
the angular cut of Eq. ffTTl) have a significantly lower lepton than the signal and VBF 
events. This is demonstrated in Fig. On the left hand side the distributions of minimum 
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Figure 5: Normalized distributions of the minimum charged lepton p? for inclusive cuts of 
Eq. (Q (left) and after the additional ARu and mu cuts of Eq. (TlTI) (right). Curves are for 
signal and backgrounds as in Fig. [2j 



lepton p T are plotted for signal and background, for the inclusive cuts Eq. (jUJ). The curves 
lie almost on top of each other and there seems to be no difference between the various 
processes. The right hand side of Fig. [5] shows the same distributions after applying the Ru 
and mu cut. This suggests a harder lepton px cut and we impose 

P T ^>30GeV (12) 

in the following. Note that this harder cut on the two charged leptons implies excellent 
trigger efficiencies even in high luminosity running. 

It is known that the transverse mass of the dilepton-^ T system can be used to reconstruct 
the Higgs boson mass. This works particularly well for masses at or below W pair threshold. 
We here use the transverse mass definition of Ref . [I] , 

m™ w = sJ{p T + E TM y - (fau +/r) 2 (13) 
in terms of the invariant mass of the two charged lepton and the transverse energies 

Et,u = (Pt,u + m 2 u ) l '\ P T = (A + ™IY' 2 - (14) 

In Fig. [6] the GF signal and the VBF process show a pronounced Jacobian peak in the 
m T 7W distribution whereas the ti+jets and QCD W + W~jj backgrounds events are broadly 
distributed. Since the process of gluon induced Higgs+2 jet production will not be a discovery 
channel for the Higgs boson, we can assume that the Higgs boson mass is known and we 
can further optimize our cuts for this mass. Hence we impose a strict cut on the m^ w 
observable: 

m^<170GeV (15) 

Considering Eqs. ( Tl3l) and (tHl), m^ w and m« are apparently correlated with each other. 
We find that the ratio m«/my contains further useful information. We apply a cut of the 
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Figure 6: Left: Normalized distribution of the transverse mass of the dilepton system as 
defined in Eq. ( TT3|) . Right: The ratio of the dilepton invariant mass m« and transverse mass 
m^ w . Curves are for signal and backgrounds as in Fig. [2] with the inclusive cuts of Eq. (Q. 

form: 

m u < 0.44 • mf w (16) 

With these cuts the backgrounds are again strongly reduced while the Higgs induced pro- 
cesses are affected at the 10% level only. The results for the cuts of Eqs. ( fT2l) . ( fT5l) and ( fl~6l) 
are shown in line 4 and 5 of Table [3 

Table 3: Signal and background cross sections and the expected number of events for C int = 
30 fb -1 at different levels of cuts. 





inclusive cuts 


selection cuts 


selection cuts + Eq. (1211) 


process 


a [fb] 


a[fb] 


events / 30 fb 1 


a[fb] 


events / 30 fb" 1 


GF pp — > H + jj 


115 


31.5 


945 


10.6 


318 


EW pp -> W + W~ + j j 


75 


16.5 


495 


13.9 


417 


pp — * tt 


6830 


23.3 


699 


1.5 


45 


pp — > tt + j 


9520 


51.1 


1530 


13.4 


402 


pp — > + j j 


1680 


11.2 


336 


3.8 


114 


QCD -> W + W~ + jj 


363 


11.4 


342 


3.0 


90 


sum of backgrounds 


18500 


114 


3410 


35.6 


1070 



At this level the signal rate is reduced by a factor of 3 as compared to the inclusive cuts, 
but the backgrounds now have cross sections of the same order as the signal. The largest 
background still arises from the tt processes, especially tt+lj, i.e. one tagging jet arises from 
an (unidentified) 6-quark from t or t decay and the other one is due to emission of a light quark 
or gluon in tt production. For an integrated luminosity of C int = 30 fb -1 the rates correspond 
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to a purely statistical significance of the gluon fusion signal of S/y/B m 17. However, 
additional backgrounds arise from ££jj events where the missing transverse momentum is 
generated by detector effects. It has been shown in Ref. [4] that these backgrounds are under 
control when requiring a missing px of at least 30 GeV: 

/ T >30GeV (17) 

The $ T distributions are shown on the right hand side of Fig. [3j The GF signal and the 
backgrounds which we have considered are affected similarly by this cut. The resulting cross 
sections are shown in the last line of Table [2J We are left with a signal to background ratio of 
one over 3.6. For C int = 30 fb _1 we expect 945 signal events on top of 3400 background events 
as summarized in Table[3j This corresponds to a statistical significance of S/ \[B « 16. Note, 
however, that the top-production backgrounds need to be understood with an accuracy of 7% 
or better in the signal region in order to allow for a 5a Higgs signal from rate measurements 
alone. If background uncertainties turn out to be that large one may want to reexamine the 
selection. A significantly higher signal to background ratio, and a concomitant smaller effect 
of background uncertainties, can be achieved by more aggressive cuts, as is obvious from the 
various distributions in Figs. H] to [71 In the absence of solid estimates for systematic errors 
we have tried to optimize the statistical significance and leave further refinements to future 
studies. In any case, the event rates summarized in Table [3] are sufficiently large to allow 
the analysis of distributions. 



x10' 3 




rrij, (min) [GeV] 



Figure 7: Normalized distributions for the minimum jet-lepton invariant mass. Curves are 
for signal and backgrounds as in Fig. [2] with the inclusive cuts of Eq. (J9j). 

It is clear that a cut based analysis as described above is not the optimal way to isolate 
the signal from the background. More advanced techniques like neural networks or likelihood 
methods should be considered. As input for such refinements we show one further observable, 
which, however, could not be exploited in our cut based approach. The minimum jet-lepton 
transverse mass rriji, i.e. the minimum of the four combinations of the charged leptons with 
the two tagging jets, is plotted in Fig. [7J The distribution shows that rriji is bounded by the 
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top mass for the ti processes. Only a small tail is left above mt for the tijj process. This 
characteristic remains after the selection cuts. Since the contribution from top backgrounds 
is so different for rriji < 150 GeV and rriji > 150 GeV, it might be useful to perform separate 
cut optimizations for the two regions. 



4 Azimuthal angle correlations 

In order to determine the tensor structure of the effective Hgg coupling, the distributions 
of the two tagging jets are an important tool. The distribution da/d\A<&jj\ of the azimuthal 
angle between the two tagging jets provides for an excellent distinction between the two 
tensor structures of Eq. [6J. Unfortunately, when both CP-even and CP-odd couplings 
of similar strength are present, the tensor structure cannot be unambiguously determined 
anymore. The missing information is contained in the sign of the azimuthal angle between the 
tagging jets [Uj. Naively one might assume that this sign cannot be defined unambiguously 
in pp collisions because an azimuthal angle switches sign when viewed along the opposite 
beam direction. However, in doing so, the "toward" and the "away" tagging jets also switch 
place, i.e. one should take into account the correlation of the tagging jets with the two 
distinct beam directions. Defining A as the azimuthal angle of the "away" jet minus the 
azimuthal angle of the "toward" jet, a switch of the two beam directions leaves the sign of 
A$jj intact. To be precise, let us define the normalized four-momenta of the two proton 
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Figure 8: Normalized distributions of the jet-jet azimuthal angle difference as defined in 
Eq. (TT51) . The curves are for the SM CP-even case (a 3 = 0), a pure CP-odd (a 2 = 0) and a 
CP-mixed case (a 2 = a 3 ^ 0). The cuts of Eq. ([HD and (|2"TI) were applied. 
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beams as b + and 6_, while p + and p- denote the four momenta of the two tagging jets, where 
p + points into the same detector hemisphere as b + . Then 

e^peKp+W-P- = 2Pt,+Pt,- sin(0+ - <t>J) = Ipt^Vt,- sin A^ (18) 

provides the sign of A$jj. This definition is manifestly invariant under the interchange 
(b + ,p + ) <-> (6_,p_), i.e. when viewing the event from the opposite beam direction, and we 
also note that A$jj is a parity odd observable. 

The corresponding azimuthal angle distribution is shown in Fig. [H] for the gluon fusion 
Higgs signal for three scenarios of CP-even and CP-odd Higgs couplings. All three cases are 
well distinguishable. The maxima in the distribution are directly connected to the size of 
the parameters a 2 and 03, which were introduced in Eqs. (I3f4"|) . For 

a 2 = a cos a , a 3 = a sin a , (19) 

the positions of the maxima are at A$jj = a and A$jj = a ± 71". This also explains why 
|A$jj| loses information in the mixed CP case: when folding over the A ^-distribution at 
A$jj = 0, maxima and minima at +45 and —45 degrees cancel each other. 

As noted above, A is a parity odd observable. Finding a A^- asymmetry as in Fig. [S] 
would show that parity is violated in the process pp — > HjjX. Since the QCD couplings 
are parity conserving, the parity violation must originate from a parity-odd Higgs coupling, 
namely 03 in the effective Hgg vertex. This term is also CP-odd. Such a coupling, occurring 
at the same time as the CP-even SM coupling a 2 , implies CP- violation in the Higgs sector. 
In this sense, the observation of an asymmetry in the A$jj distribution would directly 
demonstrate CP-violation in the Higgs sector. 




Figure 9: The A$ J:) distribution for a pure CP-even coupling (left) and a pure CP-odd 
coupling (right) for C int = 300 ftT 1 . From top to bottom: GF signal, EW W + W~jj, ti, tij, 
tijj, and QCD W + W~jj backgrounds. 

The azimuthal angle difference A$jj of the two tagging jets is the observable that carries 
information about the CP nature of the Htt coupling. After we have improved the S/ \fB 
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ratio as much as possible by means of selection cuts, we now want to extract this information 
from the AQjj distribution. For this purpose we define a fit function 

/(A$) = N(l + Acos[2(A$ - A$ max .)] - flcos(A$)), (20) 

with fit-parameters A, A<& max , B, N. The parameter N is a normalization factor and B is 
an overall shape-factor. The parameters A and A<& max are the physically relevant ones. A 
describes the relative magnitude of the angle correlation and thus the significance of this 
measurement with respect to the background fluctuations. A§ max gives the position of the 
first maximum, which measures the relative strength of CP-even and CP-odd couplings. 




Figure 10: The A&jj distribution for CP-mixed couplings as in Fig. [9] for Ci n t = 300 fb . 
Left: y t = yt = Vt M - R^ht: y t = -yt = yf M - 

Figure M shows the expected AQjj distribution for a purely CP-even (y t = yf M , yt = 0) 
and a purely CP-odd Htt coupling (y t = 0, y t = y^ M ) for an integrated luminosity of 300 fb~ . 
Plotted are signal events on top of the various backgrounds. The black curve is the fit to this 
distribution with the function of Eq. (120]) . A comparison with Fig. [8] shows that the relevant 
characteristic angular correlation is kept but diluted due to the background. However with 
the help of the fit we can extract the parameters A and A$ max . In order to estimate 
the statistical significance of the measurement, we divide our Monte Carlo sample into 10 
independent samples of data, each corresponding to an integrated luminosity of 30 fb -1 . 
Averaging the results and the errors, we obtain A = 0.115 ± 0.039, A§ max = —3.0 ± 10.7 
for the CP-even coupling and A = 0.210 ± 0.034, A$ max = 90.4 ± 4.7 for the CP-odd 
coupling, where errors are purely statistical. The expected values would be A$ max = and 
A^ m ax = 90 respectively. Since the difference between the CP-even and the CP-odd case can 
also be expressed as a flip in the sign of A (keeping A$> max = 0) this result means that a CP- 
odd Htt coupling can be distinguished from the SM case with approximately 6a significance 
for 30 fb -1 , assuming a perfect detector. Since the backgrounds are fairly flat in A&jj, any 
background normalization uncertainties are largely absorbed into the fit parameters iV and 
B. Our estimates for the significance of the A determination will therefore receive minor 
changes due to such systematic errors. 
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Fig. [TO] shows the the azimuthal angle distribution for the case of a CP-mixed coupling 
y t = iyt = Ut ■ Applying the method described above, we extract A = 0.260 ± 0.031, 
A$ maa; = 59.3±3.5 from the left part of Fig. [Hand A = 0.246±0.031, A$ max . = -58.4±3.7 
from the right one. The expected values would be AQ max = ±arctan(|) = ±56.3. 




Figure 11: The A$>jj distribution of the signal for different intervals of the jet rapidity 
separation Ar/jj as labeled in the left plot. Left: CP-even coupling. Right: CP-odd coupling. 

For the procedure described above, we have applied an additional cut on the dijet rapidity 
separation of 

= 1%-%, I >3.0 (21) 

This is advantageous because the analyzing power of the A<&jj distribution strongly depends 
on the rapidity separation of the two tagging jets. This is demonstrated in Fig. [UJ which 
shows the distributions within different Arjjj intervals for the CP-even and the CP-odd case. 
For low Arjjj values the A§jj distribution does not show the characteristic features discussed 
above. However, they get more pronounced as Arjjj increases. Therefore, a cut on A^- raises 
the analyzing power of the azimuthal angle distribution. Unfortunately, an additional cut 
also decreases the number of signal events and thus the significance of the measurement. It 
turns out that a cut value around Ar) cut rs 3 leads to a minimal error on A§ max and an 
optimal AA/A ratio. 

Our analysis has been performed with parton level Monte Carlo programs based on 
leading order matrix elements for the signal and the backgrounds. In the similar situation of 
widely separated dijets in pp collisions at the Tevatron, some de-correlation in the azimuthal 
angle distribution is predicted at higher order at the parton level [231 EU [25], [26] [27] and 
with parton showers and hadronisation [28l [29] . and measured at the Tevatron [30J. A 
strong de-correlation, as originally predicted in a pure parton shower approach in Ref. [3Tj . 
would invalidate our method for Higgs CP studies. Using full leading order QCD matrix 
elements for H + 2- and H + 3-parton production and additional parton shower simulation, 
the question was reanalyzed recently [32J and a de-correlation at the 25% level was found 
for rapidity separations Arjjj > 4.2, which would imply a corresponding reduction of A in 
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Eq. fl20|) . An even smaller effect was found in the NLO calculation of Campbell et al. [10J, 
raising the possibility that part of the de- correlation found in Ref. [32] is due to tagging jets 
originating from the parton shower, which is generated flat in azimuthal angle. Considering 
these effects we expect a reduction of 25% or less in analyzing power due to higher order 
effects. A quantitative determination of the de-correlation requires further study, however. 



5 Conclusions 

Both sources of Higgs plus dijet events at the LHC, vector boson fusion and gluon fusion, can 
provide important information on Higgs boson properties. Vector boson fusion is sensitive 
to the couplings of the Higgs boson to weak bosons, gluon fusion measures the effective Hgg 
coupling, which, within the SM, is mostly induced by the Htt Yukawa coupling. For a Higgs 
boson mass of 160 GeV and SM couplings, the decay channel H — > WW — > l + l~$ T provides a 
highly significant signal for both the vector boson fusion [3] and the gluon fusion signal above 
backgrounds, which are dominated by top quark pair production. The distinction of the two 
Hjj channels is most easily achieved, at the statistical level, by using characteristically 
different distributions of the two tagging jets, mainly their rapidity separation and the dijet 
invariant mass [33J. A central jet veto can be used to further enhance vector boson fusion 
over gluon fusion events [3]. 

A further analysis of the structure of the Htt Yukawa coupling, in particular the ques- 
tion whether this coupling is CP-even or CP-odd or a mixture of the two, is possible via 
the azimuthal angle correlation of the two tagging jets. Taking into account the rapidity 
correlations of the tagging jets with the beam directions, the sign of this azimuthal angle 
can be defined [llj and the resulting full distribution in azimuthal angle separation A<&jj 
exhibits the relative strength of CP-even and CP-odd couplings via a phase shift. The A$ J:) 
distribution is sensitive to CP violation in the Higgs sector. For the case analyzed here, 
rriH = 160 GeV with SM size production cross section in gluon fusion and SM-like decay 
branching fractions, a highly significant measurement is expected with an integrated lumi- 
nosity well below 100 fb _1 . A precise determination of the analyzing power requires a full 
detector simulation, however, which is beyond the scope of the present paper. 

The methods presented here for the particular case of m# = 160 GeV, can be extended to 
other Higgs boson masses. For 150 GeV < m# < 2m^ the present analysis should be readily 
applicable, with minor modifications, like in the transverse mass cut of Eq. (TT5|) . Lower 
Higgs boson masses may require somewhat softer lepton cuts and will eventually run into 
statistical problems due to the smaller branching ratio for H — > WW decay as compared to 
nin — 160 GeV. The azimuthal angle correlations of the tagging jets are independent of the 
specific Higgs decay mode, of course. This raises the question whether H — > tt or H — > 77 
signals from gluon fusion induced Hjj events are observable at the LHC. Given the much 
more difficult task of QCD background reduction as compared to vector boson fusion studies 
which we have found in this paper for the "easy" mu = 160 GeV case, we expect such novel 
Higgs signals from gluon fusion induced Hjj events and the study of their azimuthal angle 
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correlations to be quite challenging, but worth considering. 
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